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METHOD AND APARATUS FOR CHARACTERIZING 



THE QUALITY OF A NETWORK PATH 

Field 

The invention relates generally to paths of one or more computer 
5 networks, and particularly relates to metrics characterizing the network paths. 

Background 

The use of efficient routing algorithms, such as the bellman-ford 
algorithm and the open shortest path (OSP) algorithm is highly desirable in 
complex networks. Until today, these algorithms use metrics that are mostly 

1 0 static, and that are based on very coarse approximations of path performance 
quality (e.g., number of hops, user-defined static costs that are associated to 
each of the links in the network). As the Internet becomes more and more 
ubiquitous, metrics that characterize the quality of network applications running 
across network paths become more important. Metrics for voice and video have 

1 5 been devised in the past; however these metrics are complex and not adapted for 
routing, and instead can be used to report performance at select points in the 
network. 

Summary 

Metrics that at the same time are (1) additive and (2) characterize the 
20 performance of network applications are highly desirable, as they allow routing 

algorithms and devices that can take into account factors that matter to the users 

of these applications. 

We describe multiple methods and apparatuses for characterizing the 

quality of a network path by means of a metric. This metric characterizes a 
25 plurality of one or more network applications running across a network path. 

The quality characterization characterizes a quality of the same plurality of one 

or more network applications running at one or more end-points of the path. 

This metric is at least a function of a plurality of one or more elementary 

network parameters. The plurality of one or more network parameters include 
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one or more of delay, jitter, loss, currently available bandwidth, and intrinsic 
bandwidth. 

This metric is also additive, in the sense that given a network path that 
includes a first segment and a second segment, the metric score across the 
network path is equal to the sum of the metric score across the first segment, 
and the metric score across the second segment. 

Additional features, benefits, and embodiments of the present invention 
will become apparent from the detailed description, figures and claims. 

Brief Description of the Drawings 

Fig . 1 voice traffic: performance degradation (MOS - MOS mi »)/(MOS max 
- MOS min ) of some embodiments versus (a) percentage speech clipped and (b) 
RTT. 

Fig. 2 short TCP connections: performance degradation 
{Latency mini Latency) of some embodiments versus (a) the loss rate and (b) RTT. 

Fig. 3 typical file transfer: performance degradation 
{Throughput/Throughput,^ of some embodiments versus (a) the loss rate and 
(b) RTT. 

Fig. 4 MOS contours of some embodiments versus RTT and % speech 

lost. 

Fig. 5 normalized MOS degradation of some embodiments versus (a) 
speech lost and (b) one-way delay with negative exponentials. 

Fig. 6 shows an embodiment of approximating Latency m J Latency of 
some embodiments versus (a) one-way delay and (2) loss rate with negative ^ 
exponentials. 

Fig. 7 shows an embodiment of approximating Latency m J Latency of 
some embodiments versus (a) one-way delay and (2) packet loss with negative 
exponentials. 

Fig. 8 approximating ThroughputIThroughput max of some embodiments 
versus (a) one-way delay and (2) packet loss with negative exponentials. 



Fig. 9 approximating ThroughputlThroughput max of some embodiments 
versus (a) one-way delay and (2) packet loss with negative exponentials. 

Fig. 10 series of mappings involved in the translation of raw 
measurement traces into a web transaction score of some embodiments. 

Fig. 1 1 dynamics of a Simple TCP connection of some embodiments. 

Fig. 12 cwnd versus time for a TCP connection over of some 
embodiments (a) a high bandwidth path and (a) a low bandwidth path. 

Fig. 13 download of a web page which consists of one html file and nine 
images using HTTP/1 .0, with the Keep-Alive option either set or not of some 
embodiments. 

Fig. 14 download of a web page, which consists of one html file and 
nine images using HTTP/1 .1, using either persistent connections, or persistent 
and pipelined connections of some embodiments. 

Fig. 15 latency of a typical web transaction for different file sizes, given 
various loss rates versus one-way delay, using either HTTP/ 1.0 or HTTP/ 1.1 of 
some embodiments. 

Fig. 16 typical web transaction for different file sizes, given various 
round-trip times versus the loss rate, using either HTTP/ 1.0 or HTTP/ 1.1 of 
some embodiments. 

Fig. 17 latency of a typical web transaction for different file sizes, given 
various round-trip times versus one-way jitter, using either HTTP/1 .0 or 
HTTP/1.1 of some embodiments. 

Fig. 18 user's impression versus web transaction duration A. Bouch, N. 
Bhatti, and A. J. Kuchinsky, Quality is in the eye of the beholder: Meeting 
users' requirements for Internet Quality of Service. To be presented at 
CHI'2000. The Hague, The Netherlands, April 1-6, 2000, pp. 297-304.and A. 
Bouch, N. Bhatti, and A. J. Kuchinsky, Integrating User-Perceived Quality into 
Web Server Design, HP Labs Technical Report HPL-2000-3 20000121 of some 
embodiments. 

Fig. 19 impermissible rate (%) versus MOS, N. Kitawaki and K. Itoh, 
Pure Delay Effects on Speech Quality in Telecommunications, IEEE Journal of 
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Selected Areas in Communications, Vol. 9, No. 4, May 1991 of some 
embodiments. 

Fig. 20 metric of a typical web transaction for different file sizes, given 
various loss rates versus one-way delay, using either HTTP/ 1.0 or HTTP/ 1.1 of 
5 some embodiments. 

Fig. 21 metric of a typical web transaction for different file sizes, given 
various round-trip times versus the loss rate, using either HTTP/ 1.0 or 
HTTP/1.1 of some embodiments. 

Fig. 22 metric of a typical web transaction for different file sizes, given 
1 0 various round-trip times versus one-way jitter, using either HTTP/ 1 .0 or 
HTTP/1.1 of some embodiments. 

Fig. 23 network device of some embodiments deployed in a network. 

Fig. 24 network device of some embodiments deployed in an 
internetwork, at a point of presence between one or more networks. 

1 5 Fig. 25 network device of some embodiments used for routing deployed 

in an internetwork. 

Fig. 26 network device of some embodiments used for routing, deployed 
in an internetwork, at a point of presence between one or more networks. 

Fig. 27 shows some possible embodiments with devices that are 
20 communicating with each other, for example sending and receiving 
measurement packets. 

Fig, 28 shows one specific detailed embodiment with two devices, 
where each device is sending and receiving measurement packets as well as 
selecting a subset of paths. 
25 Fig. 29 shows an embodiment with more than two devices that are 

sending and receiving measurement packets to obtain measurements of 
performance characteristics of paths and to communicate measurements 
statistics about those paths. 
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Detailed Description 

We describe multiple embodiments of a method that translates 

elementary network parameters, wherein the plurality of one or more network 

parameters include one or more of delay, jitter, loss, currently available 

5 bandwidth, and intrinsic bandwidth into a metric that measures, at least in part 

quality characterizations of a same plurality of one or more network 

applications, wherein the quality characterization characterizes a quality of the 

same plurality of one or more network applications user-perceived quality 

metrics- In some embodiments of this invention, this metric captures the user- 

# 

10 perceived experience of the performance of an application. This metric is 
additive, in the sense that given: 

- a network path, including a first segment and a second segment; 

a first metric and the second metric, which are at least in part 
quality characterizations of a same plurality of one or more 
15 network applications, wherein the quality characterization 

characterizes a quality of the same plurality of one or more 
network applications running at one or more segment end-points 

wherein the first metric and the second metric are at least partly a 
function of a same plurality of one or more elementary network 
20 parameters, wherein the "plurality of one or more network 

parameters include one or more of delay, jitter, loss, currently 
available bandwidth, and intrinsic bandwidth 

wherein the first metric is at least partly the function of the same 
plurality of elementary network parameters of the first segment, 
25 wherein the one or more segment end points include one or more 

end-points of the first segment 

wherein the second metric is at least partly the function of the 
same plurality of elementary network parameters of the second 
segment, wherein the one or more segment end points include 
30 one or more end-points of the second segment, 
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adding the first metric and the second metric generates a third metric, wherein 

- the third metric is at least partly the function of the same 

plurality of one or more elementary network parameters of the 
network path, wherein the one or more segment end points 
5 include one or more end-points of the network path 

the third metric is a quality characterization of a same plurality 
of one or more applications 

In the process of designing such a metric, we first describe the insight that 
drives the general methodology, as well as the methodology specifics for voice, 
10 video, TCP traffic, HTTP/1 .0, and HTTP/1 . 1 respectively. Applying the 

described technique for voice, video, and data traffic, we derive embodiments of 
such an application specific metric for each type of application independently. 

General methodology 

Let's look at the general shape of performance degradation versus delay 

15 and loss curves for voice and TCP data traffic, where performance means the 
performance metric that matters for the respective application: MOS score for 
voice traffic, throughput and latency for TCP data traffic. (See Figs. 1-3.) 

In some embodiments of this invention, these curves are normalized. 
Normalizing the performance curves includes translating them to a 0-1 

20 performance score, where 1 represents no degradation, whereas 0 represents 
total degradation. Other embodiments can use other scales such as inverted 
scales from 1-0 and/or different scales such as 0-10. For example, in some 
embodiments of this invention, in one embodiment of the voice application, a 
MOS (Mean Opinion Score) of 4 is converted to a metric of 1 for voice (since 4 

25 is the maximum that can be attained in a telephone system), whereas a MOS 

score of 1 maps to a metric of 0 (that is, total degradation). On the other hand, in 
some embodiments of this invention, for general TCP applications, it is 
assumed that a metric of 1 for short TCP transactions translates into a fast, 
reliable response, whereas a metric of 0 represents "infinitely" slow response. 

30 Finally, in some embodiments of this invention, for TCP file transfer, a metric 
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of 1 corresponds to a high throughput, while a metric of 0 corresponds to a 
throughput that is close to 0. Note that in the context of TCP, a loss rate of 0 is a 
physically plausible phenomenon that gives the benchmark performance in 
terms of latency and throughput to which the performance achieved over lossy 
5 paths can be compared to. On the other hand, for some embodiments of this 
invention, a round-trip time of zero is less realistic, and leads in one 
embodiment with TCP traffic to both infinite throughput and zero latency as 
long as the loss rate is less than 100%. Hence, in some embodiments of this 
invention, the round-trip time for which the benchmark latency and throughput 

10 measures are computed, RTTo is larger than 0, and can be chosen according to 
the characteristics of the particular system in consideration. In some 
embodiments, the internetwork considered includes national networks spanning 
large geographical areas; in this context, a one-way delay of less than 25 ms is 
uncommon. Hence, in such embodiments, a RTTo = 50 ms can be appropriate. 

15 In some embodiments, the choice of RTTo is significant, since different values 
of RTTo lead to different shapes for the curves in Figure 2b and Figure 3b, 
which in turn lead to different metric parameters. 

The shape these curves is very similar between many embodiments. In 
some embodiments, the curves corresponding to voice (Fig. 1) have a shape that 

20 can be approximated by a negative exponential. In some embodiments, the 
curves corresponding to TCP applications (Figs. 2-3) are hyperbolic in p and 
RTT; in some embodiments of this invention, a hyperbolic function can be 
approximated, for a portion of the parameters' ranges with an exponential 
function. The related issues are dealt with later in the appendix, in the section 

25 reserved for TCP applications. 

In some embodiments, it is assumed that these curves can be fitted by 
negative exponential functions in some portion of the parameters' ranges. In 
some embodiments, in both the case of voice and data traffic, it is also assumed 
that one-way delay is half the round-trip time delay, so performance degradation 

30 versus one-way delay curves can be obtained. The same theory can apply to 
embodiments of voice and/or TCP traffic. For simplicity, in the remainder of 
this section, we describe an embodiment in the context of voice traffic. Hence, 
in this embodiment, the following equations estimate performance degradation 
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versus one-way delay and loss, m v< j and m v i, respectively, corresponding to 

voice traffic: 

m vD = exp(-a v £>) 

w v/ = exp(-/? v /) 

5 

Delay-loss MOS contours have not, to the extent of our knowledge, been 
studied extensively in previous subjective studies of voice quality. Hence, in 
some embodiments, assumptions are made in the way a metric m v combining 
m v o and m v i can be obtained from both metrics' equations. In one embodiment, 
10 one intuitively appealing technique that can be used to combine m vD and m v i can 
be used. Given the equations above, performance is close to perfect when both 
m V D and m v / are close to 1 . That is, we have 

(m v o = 1 and m v i = 1) => m v = 1 

15 On the other hand, note that if either m vD or m v i are close to 0, then the quality as 
perceived by the user will be bad, i.e., m v must be close to 0. Hence, the second 
relation between m v £>, m v / and m v ought to be 

(m v & = 0 or m v i = 0) ■=> m v = 0 

20 

One operator that satisfies both relations above is x. In this embodiment, we use 
this operator; the resulting metric for voice becomes 

™ v = m vo x Wrf = ex p(" <z v D)x exp(- J3 v l)= exp(- a v D - pj) 

25 

According to the obtained metric, equal metric contours in the (A /) plane are 
straight lines (See Fig. 4.) 

Note that since the exponential function is monotonic, the metric is also 
30 additive, in the sense described above, (which we denote MS in this document) 
is equal to 
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metric = D + — / 

Defining 5 V to be equal to p v /a v , we get 



metric = D + SJ 

5 

In this embodiment, the metric has two properties: on one hand, it can be 
simply modified using a single parameter 5 V . (Other embodiments can use more 
than one parameter.) On the other hand, it is additive, in the sense described 
above. This characteristic can be useful in the context of routing; specifically, in 

10 some embodiments, this metric can be used for routing, using algorithms such 
as Distance Vector and SPF (Shortest Path First). That is, using such a metric, 
dogleg routing can be easily implemented. In the following, we justify this 
property in the context of this specific embodiment: assume a path includes two 
links, L\ and Z 2 . The one-way delay and loss across link L\ are denoted D\ and 

15 t\ 9 while the delay and loss across link L 2 are denoted D 2 and h> respectively. 
We also define metric\ and metric^ to be the resulting metrics on link L\ and 
link L 2 , respectively, while the total metric metric is set to metric] + metic 2 = D\ 
+ D2 + 5v (J\ + h)- In this particular example, we use the one-way delay equation 

£>,=<i,+4v„ / = 1,2 

20 Clearly, D = D\ + D 2 is only an approximation of the delay on the link 

since in some embodiments, jitter is not additive. However, in some 
embodiments, the total jitter across a path is smaller or equal to the sum of jitter 
values for all segments on the path. Hence, in these embodiments, D\ + D 2 is a 
higher bound on the actual delay D over the link. In other words, setting the 

25 delay for the total path to D\ + D 2 , a 2 hop path is effectively penalized over a 
one hop path. In some embodiments, this is intuitively justifiable, since using 
two hops as opposed to one hop exhibits potentially extra overhead which is not 
taken into account by the delay metric, such as the potential additional delay 
incurred at the node connecting the two hops, and other reliability issues in 

30 terms of routing and connectivity. Hence, in such embodiments, it appears 
reasonable to penalize routes with a higher hop count. 
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On the same line of thought, let's derive the total loss rate on the path. In 
some embodiments, the losses on Links L\ and L2 can be assumed to occur 
independently; in such embodiments, then the total loss can be found using 

l-/ = (l_/,)(l-/ 2 ) 

5 / = /i + /2-/1/2 

That is, the actual loss on the path is lower than the value l\ + h assumed by the 
additive nature of the metric. In fact, in embodiments where the losses on links 
L\ and L2 are correlated, the actual loss is generally lower than l\+h — hh- 
However, first note that for the range of loss rates of interest to some 

10 embodiments, (say 1-10%), l\h is much smaller than l\ or h (for example, for l x 
= 10% and h = 10%, then l\h is a mere 1%). Hence, in such embodiments, l\l 2 
can safely be ignored from the equation for /. Furthermore, in some 
embodiments, the same argument can be set forth, as was made for the 
computation of delay, namely that penalizing the route that has a larger hop 

1 5 count is justifiable from a design point of view. 

The arguments described above demonstrate the adequacy and 
practicality of the metric metric = D + 5/ for some embodiments. In the sections 
below, the theory described above, which applies to some embodiments of this 
invention, is used in the specific contexts of voice and TCP data traffic, 

20 respectively, leading to some embodiments of this invention for voice and TCP. 
In some embodiments, it is useful to have a value of 8 that is adequate both for 
voice and TCP. In some embodiments the value of 5 is common for both voice 
and data traffic. 

25 Metric for voice traffic 

In this section, we approximate the normalized MOS degradation curves 
shown in Fig. 1 with negative exponentials. The results are shown in Fig. 5. 
Focusing first on Figure 5a, we can see that fitting the curve with an exponential 
leads to either an under-estimation of the degradation for low speech loss (that 

30 is, less than 4% loss rates) or to an over-estimation of the degradation for high 
loss rates (e.g., 10% loss rates and above). In order to pick an appropriate fit, 
one must remember that the curve shown in Fig. 5a assumes no error resiliency 
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at all. That is, a lost packet is replaced with silence. However, most modern 
voice decoders have features that enable some sort of error resiliency, in such a 
way that the effect of small clips can be significantly mitigated. As a result, one 
might expect that in actual modern voice encoders, the MOS degradation 
corresponding to low speech lost is not as high as that shown in Figs, la and 
Fig. 5a. In addition, it is clear that the efficiency of such error-resilient decoders 
gets lower as the packet loss increases; for loss rates exceeding 10%, no error 
correction can overcome the degradation caused by the amount of information 
lost. Hence, as shown in Fig. 5a, we fit the degradation curves with negative 
exponentials that underestimate the degradation for low speech loss (that is, for 
values of / lower than 4%), and matches the degradation for values of / 
exceeding 6%. We get 

exp(- 25/) < m Yl < exp(- 20/) 
with an average of 

m v / = exp(-23/) 

Similarly, we fit negative exponentials to the curves representing MOS 
degradation versus one-way delay D (obtained from the MOS versus RTT 
curves using the simple D = RTT/2 relation). (See Figure 5b.) Here too, in some 
embodiments of this invention, a choice has to be made between slightly under- 
estimating the loss for low values of delay, and over-estimating the loss for 
higher values of delay. In some embodiments, the curves are derived from 
experiments conducted in N. Kitawaki and K. Itoh, Pure Delay Effects on 
Speech Quality in Telecommunications, IEEE Journal of Selected Areas in 
Communications, Vol. 9, No.4, May 1991, for different tasks: 

• Task 1: Take turns reading random numbers aloud as quickly as possible 

• Task 2: Take turns verifying random numbers as quickly as possible 

• Task 4: Take turns verifying city names as quickly as possible 

• Task 6: Free conversation 

In some embodiments of this invention, the experiments that involve 
intense interaction, such as business calls or transaction-related calls are the 
most relevant. For these embodiments, the metric is optimized for tasks that 

11 
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resemble more Tasks 1 and 2 (from N. Kitawaki and K. Itoh, Pure Delay Effects 
on Speech Quality in Telecommunications, IEEE Journal of Selected Areas in 
Communications, Vol. 9, No.4, May 1991, then Tasks 4 and 6. In addition, for 
such embodiments, a round-trip time that is larger than 500ms (that is, one way 
5 delays larger than 250 ms) is not desirable, as it gives a sense of poor quality 
that is not well reflected by the low performance degradation obtained in N. 
Kitawaki and K. Itoh, Pure Delay Effects on Speech Quality in 
Telecommunications , IEEE Journal of Selected Areas in Communications, Vol. 
9, No.4, May 1991. Thus, for such embodiments, the chosen voice negative 

10 exponential curves fit closely the results obtained in N. Kitawaki and K. Itoh, 
Pure Delay Effects on Speech Quality in Telecommunications IEEE Journal of 
Selected Areas in Communications, Vol. 9, No.4, May 1991, for low values of 
delay (that is, for D smaller than 250 ms), and over-estimates the MOS 
degradation for values of D that are larger than 250 ms (that is, for round-trip 

1 5 times that are larger than 500 ms). Hence, one embodiment that corresponds to 
this embodiment uses the following metric: 

exp(-l . ID) < m vD < exp(-2.0D) 

with an average of 

20 

m V D = exp(-1.5D) 

That is, a v = 1 .5, p v = 23, yielding S v = 15. Also, using the bounds for a v and p v 
obtained above, the corresponding minimum and maximum values for 5 V 
25 become: 10<S V <23. 

Metric for video traffic 

In this section, we describe the derivation of one embodiment of a 
metric for video. In this embodiment, we use a model that is very similar to that 
used for voice. Indeed, it is assumed in this embodiment that the degradation of 
30 a voice conversation because of excessive delay is very similar to the 
degradation of a video communication. In applications such as video- 
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conferencing, the aural sense (the fact that one end is listening to the other end) 
is complemented by the visual sense (that is, the fact that one end also sees the 
other end). As a result, in some embodiments, the video metric can be 
considered slightly less sensitive to delay than the voice metric. 

5 As far as loss is concerned, in some embodiments, the voice metric 

underestimates the effect of loss in video quality; indeed, in such embodiments, 
the loss of one video frame can affect a large number of frames. The actual 
number of frames affected depends on the encoding of the video sequence. In 
this embodiment, we use the concept of useful frames to derive, from the metric 

10 obtained for voice, a metric that can be applied for video. A useful frame 
denotes a frame that is successfully decoded at the receiver. In some 
embodiments, the effect of loss on the metric can be increased by taking into 
account the average number of frames lost by the encoder upon the loss of a 
frame as video traffic traverses a path (that is, in some embodiments, as video 

15 traverses the network). In such an embodiment, the ratio of frame loss (that is, 

those frames that the receiver is unable to decode successfully) vs. packet loss is 
used to obtain a scale factor that can be applied to the loss component of the 
voice metric function, yielding the loss component of the video metric. In this 
embodiment, the following methodology is used to derive this ratio. First, a 

20 model for frame loss along the path is assumed. In this embodiment, it is 

assumed that all frames are affected by lost independently, e.g., the loss model 
is Bernoulli. Those skilled in the art can follow the same methodology and 
derive a similar metric for video for other loss models considered. (Specifically, 
in some embodiments of this invention, it is useful to consider a loss model that 

25 is clustered, following a model of loss that is attributed to the Internet.) The 

independent model can, in some embodiments, be applied to the Internet, since 
every frame is typically formed by more than one packet; that is, in such an 
embodiment, it is assumed that a cluster of packet loss occurs in a single frame, 
hence this independent assumption holds in the Internet. In some embodiment, 

30 it is assumed that a Group of Pictures (GOP) contains N frames, one I frame, 
and N-l P frames. If an I frame is lost, the rest of the N -I frames of the GOP 
are lost; if the first P frame is lost, the rest N - 2 frames of the GOP are affected, 
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and so on. Therefore, in some embodiments, the scaling factor can be computed 
as: 

SF vldeo = (1/7V) x [1 + 2 + . + (N- 1)] = (N - l)/2 
Hence, in some embodiments, the video model becomes: 

5 

metric vic ieo — exp(-£> - 69/) 

That is, a v = 1 .0, p v = 69, yielding 5 V = 69. Note that in this embodiment, the 
value of a v is smaller for video than for voice, to take into account the fact that 

10 video is less sensitive to delay. Also, in this embodiment, the value of p v is 

scaled by a factor SF v \d eo = 3, which corresponds to N = 7. Those skilled in the 
art can derive metrics that use different values of a v = 1 .0, p v = 69, depending 
on the set of assumptions used. Also, those skilled in the art can use the 
methodology described above to derive various metrics, based on different 

1 5 parameters, depending on the assumptions. 

Metric for TCP data traffic 

In this section, we describe an embodiment of this invention that applies 
for generic TCP traffic. We find the appropriate values for a d , p rf and hence 8 d 

20 in this context. In some embodiments of this invention, the metric is applied to 
short TCP connections. In other embodiments, the metric is adapted to the 
throughput of a typical file transfer, assumed to be 75 KBytes; in yet other 
embodiments, the metric is to be adapted to the throughput of an infinite file 
transfer. Some embodiments that cover two cases: (1) the latency of short 

25 transactions (e.g., a buy button on some web site) and (2) the throughput of files 
that have a "typical" size of 75KBytes. Those skilled in the art can use a similar 
methodology to derive the metric for files of other sizes for other embodiments. 

Optimizing b d to capture the increase in latency for a short connection 

In some embodiments, the performance metric used for short 
30 connections is the latency incurred from the instant the user asks for the 
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transaction to be sent, to the time the packet containing the transaction arrives at 
the destination. Hence, Id = Latency(D 0 )/Latency(D) and // = 
Latency(Q)/Latency{F) are the corresponding normalized metrics, where / 
represents the one-way packet loss rate on the path, D represents the one-way 
5 delay, and Do is the minimum assumed one-way delay RTTq/2 = 25 ms. In one 
embodiment, we approximate the latter two measures with negative 
exponentials, yielding to m<j D and mdi, respectively, i.e., 

mdD = exp(-otdD) « Latency(Do)/Latency(D) 
10 m<ji = exp^PdO « Latency(0)/Latency(/) 

Note that the equation for m d o seems to lead to the following discrepancy: 
mdD{Do) while Id(Dq) = 1. It appears that this discrepancy could have been 
avoided by using a normalized version of m d D, ^doiP) = exp[-a^(Z)-Z)o)] so that 

1 5 m dD {D$) = 1 . However, by doing so, we lose the similarity between the 

performance metrics for voice and data. Also, as will be seen in the following 
graphs, rrido ~ exp(-a^Z)) approximates Id(Do) quite well. Hence, in this 
embodiment, there is no incentive in normalizing m d D in this fashion. In other 
embodiments, normalizing ntdo could be warranted. 

20 We start with an approximation of l D , that involves the derivation of a single 
constant a^. This is done in Figure 6a. As can be seen from the figure, atc</d(25 
ms) * 1 , as expected. Approximating Id with m d D leads to an overestimation of 
the performance degradation for low one-way delays (that is, for D < 1 00 ms) 
and very large delays (that is, for D > 400 ms). However, the approximation is 

25 quite reasonable for D in the [0, 500 ms] range, that is, for RTTs ranging from 0 
to one second. The corresponding value of a d is 4.5, yielding to rridD = exp(- 
4.5ZJ). Using the value = 4.5 thus obtained, we now approximate // using 
rndD-nidi ~ exp(-a^Do - PdO; the result is shown in Figure 6b. the approximation 
seems reasonable in this case too, leading only to a slight underestimation for 

30 very low (that is, lower than 1%) and very large (that is, larger than 8%) loss 
rates. The resulting value of is 16, and the corresponding m d i becomes nidi — 
exp(-16/)« 
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Therefore, in some embodiments, the corresponding value of 8 d in this 
case is 16/4.5 = 3.6. That is, for such embodiments, the value of 8 is much 
lower for data traffic than for voice traffic (in which case 8 was, as shown above 
equal to 15). In fact, in one embodiment with TCP traffic, the relative 
importance of delay as compared to loss is much higher. In one embodiment 
with video traffic, 1% loss rate was equivalent to a 150 ms delay, whereas in 
one embodiment with TCP traffic, a 1% loss rate is only worth a 36 ms delay. 
Conversely, a 100 ms delay in one embodiment with voice traffic is equivalent 
to a mere 0.6% loss rate, whereas a 100 ms delay in one embodiment with TCP 
traffic is equivalent to a 2.78% loss rate. 

In Fig. 7, we show the approximations for a range of loss rates and one- 
way delays in order to understand the effectiveness of our approximations when 
both packet loss and delay come into play concurrently. Our approximation 
does surprisingly well in most of the ranges of interest for delay and loss rate, 
respectively. 

Optimizing 8 rf to capture the decrease in throughput for a connection of 
typical file size 

A similar procedure is used in this section to derive some embodiments, 
where the metric negative exponential metric captures the decrease in 
throughput for a connection of typical file size. In such embodiments, the 
performance metric of interest is the average achievable connection throughput. 
In such embodiments, t D = Throughput {D)/ Throughput (D 0 ) and U = 
Throughput {1)1 Throughput (0) are the appropriate normalized metrics, where as 
described above, / represents the one-way packet loss rate on the path, D 
represents the one-way delay, and D 0 is the minimum assumed one-way delay 
RTTo/2 = 25 ms. Assuming that we are able to approximate the latter two 
measures with negative exponentials, yielding to m dD and m dU respectively, we 
must hence have 

m dD = exp(-a d D) « Throughput(D)/ Throughput (D 0 ) 
m d ) = exp(-p d /) * Throughput {1)1 Throughput (0) 
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Note that, as in the previous sub-section, the equation for m dD seems to lead to 
the following discrepancy: m dD (D 0 ) *1 , while t D (D 0 ) = 1 . It appears that this 
discrepancy could have been avoided by using a normalized version ofm dD , 
5 m dD (D) = exp[-aXD-A))] so that m dD (D 0 ) = 1 . However, for the same reasons 
described in one embodiment with short TCP connections, some embodiments 
(described here) find no incentive in normalizing m dD in this fashion. (Other 
embodiments could, on the other hand, find it useful to normalize m dD in this 
fashion.). 

10 .In some embodiments, an approximation of t D can be found, which 

involves the derivation of a single constant a d . (See Fig. 8a.) As for short TCP 
connections, m dD (25 ms) * 1, as expected. Also, the approximation is quite 
reasonable for D in the [0, 500 ms] range, that is, for RTTs ranging from 0 to 
one second. The corresponding value ofa d is 6.5, yielding to m dD = exp(-6.5£>). 

1 5 Using the value of a d = 6.5 thus obtained, U can be approximated using m dD .m d i 
= exp(-oc</D 0 - PdO; the result is shown in Figure 8b. The approximation seems 
reasonable in this case too. The resulting value of is 37, and the 
corresponding m d i becomes m d i - exp(-37/). 

Therefore, for such embodiments, the corresponding value of 8 d is 

20 37/6.5 = 5.7. That is, when these assumptions hold, the value of 8 is 

significantly higher than for short TCP connections (3.6), but still lower lower 
than for voice traffic (10 minimum, 15 on average). In fact, in one embodiment 
with TCP traffic, the relative importance of delay as compared to loss decreases 
as the file transferred increases. In one embodiment with typical file transfers, 

25 1% loss rate is equivalent to a 57 ms delay, which is in between the 36 ms delay 
obtained in one embodiment with short TCP connections and the 150 ms delay 
obtained in one embodiment with voice traffic. Conversely, a 100 ms delay in 
one embodiment with typical TCP connections represents a 1 .75% loss rate, in 
between the low 0.6% loss rate obtained in one embodiment with voice and the 

30 large 2.78% loss rate obtained in one embodiment with short TCP connections. 
In order to understand the effectiveness of these approximations in such 
embodiments when both packet loss and delay come into play concurrently, we 
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show in Figure 9 the approximations for a range of loss rates and one-way 
delays. As for short TCP connections, our approximation does surprisingly well 
in most of the ranges of interest for delay and loss rate, respectively. 

Unified metric for TCP traffic 

5 In summary, the graphs above show that, in some embodiments, the 

appropriate values of 8 d for short connections and typical file transfers are 3.6 
and 5.7, respectively. In some embodiments, it is desirable that one value of 5^ 
be applied to a number of TCP applications; in some embodiments, 8 d can be 
set to the average of both values, 4.6. In other embodiments, it could be 

10 desirable to use different values for each. Using a similar methodology, those 
skilled in the art can derive embodiments with different parameters, and 
different values for these parameters. 

Unified metric for both voice and TCP data traffic 

Summarizing our results 

1 5 Table 1 below summarizes the results described in the previous two 

sections for some embodiments. Other embodiments for voice traffic can take 
into account the effect of clip length. 



Table 1 Summary of values for this embodiment 



Traffic Type 


a 


P 


5 


Voice 


1.5 


23 


15 


TCP 








Short TCP 
connections 


4.5 


16 


3.6 


Typical TCP 
transfers 


6.5 


37 


5.7 



20 In some embodiments, it is desirable to use one metric for both voice 

and tcp traffic; in at least one or more of such embodiments, the metric describe 
above with 8 = 10 can be used. 
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Deriving user-perceived performance measures for web applications 

In some embodiments, the metric measures the quality of specific 
applications used by humans, that use TCP for transport. For some of these 
embodiments, adequate performance metrics measure the model the subjective 
user-perceived quality of the application. Hence, in some embodiments, TCP 
performance can be mapped to objective application performance, which, in 
turn, can be mapped to user-perceived quality metrics. In some embodiments, 
the application of interest includes a web transaction (http). Other embodiments 
can focus on constructing a metric for other such applications, such as telnet, 
ftp, or other. In some embodiment, a mapping between the web transaction 
duration and the underlying latency of a TCP transaction can be derived and 
used. Using this model, the duration of a web transaction can be found as a 
function of the network performance metrics (e.g., delay, jitter, and loss). In 
turn, for some embodiments, the duration of a web transaction can then be 
mapped to an application score, which can take into account the subjective 
perception of the application quality by the user. The sequence of such 
mappings is shown in Figure 10. In this document, we provide the insights 
behind some embodiments, that involves the choice of models at each of the 
steps shown in the procedure. We start by describing the model used for TCP 
transactions that corresponds to some embodiments. We then go over the 
specifics of the HTTP model used (yielding the duration of a web transaction) 
in some embodiments. Finally, we explain how, for some embodiments, the 
duration of a web transaction is mapped to a user-perceived metric. 

TCP models 

TCP functions and mechanisms 

In the following, we describe the Transport Control Protocol (TCP) 
modeled in some embodiments. TCP is a window-based protocol which 
principal function is to provide a reliable transport between two end-points. It 
starts with a protocol handshake, followed by the transmission of packets using 
a sliding-window algorithm, according to which the sending window advances 
as acknowledgments are received. A simple TCP transfer is shown in Fig. 1 1 . 
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TCP is also provided with mechanisms that render its utilization of resources 
(that is, bandwidth and buffer) in the network adaptive, depending on the 
conditions in the network (e.g., loading, congestion, etc.). These mechanisms 
are slow start, congestion avoidance, fast retransmit, and fast recovery. In the 
5 following, we briefly describe each of these phases. More details can be found 
both in the original congestion avoidance paper by Jacobson V. Jacobson, 
Congestion Avoidance and Control, SIGCOMM' 88. and in the invaluable book 
by Stevens W. R. Stevens, TCP/IP Illustrated, Volume I: The Protocols, 
Addison- Wesley, 1996. 

10 

Protocol handshake 

A TCP connection is preceded by a three-way handshake: the sender first 
transmits a SYN, to which the receiver responds with a SYN/ACK. The sender 
finally replies with an ACK upon the receipt of the SYN/ACK. (See Figure 1 1 .) 

15 

Slow start 

It should be clear that the bandwidth used by a TCP connection increases with 
the window size used by the sender. Since it may not be known originally how 
much bandwidth is available for the transfer, a start is selecting a small window 
20 size (typically equal to one segment). After that, it increases the window size by 
one for each acknowledgment it receives, until one of the following events 
occurs: 

1 . Either a maximum window size is reached, which is the minimum 
among the default maximum window size used by TCP, typically 64 

25 KB, and the receiver socket buffer size (which can be set by the 

application, and which default varies depending on the operating 
system). 

2. Or the rate of packets sent is larger than the available bandwidth, leading 
to a packet loss. In this case, the window is set back to 1 . A packet loss 

30 is detected using one of two ways: either the TCP retransmit timer 

expires, or three duplicate acknowledgments for the same sequence 
number of received. More details about the detection of loss can be 
found in the fast retransmit and fast recovery section. 
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In both cases, a new phase is entered, one in which a combination of slow start 
and congestion avoidance are performed, as described below. 

5 Congestion avoidance 

After either the window size reaches its maximum, or the loss of a packet 
occurs, the TCP transfer enters the congestion avoidance phase. In this 
embodiment, TCP is assumed to transmit at a given window size cwnd = W {ihc 
sender window size is called the congestion window size, cwnd), and that a loss 

1 0 event occurs. At this point, half the value of the current window W is saved into 
a variable called the slow start threshold {ssthresh = W/2), and cwnd is reset to 
1 . Thereafter, TCP enters the slow start phase, until cwnd reaches ssthresh, at 
which point TCP enters the congestion avoidance phase. During this phase, and 
as long as no packet loss is detected, cwnd is only increased by \ I cwnd at each 

1 5 ACK received, leading to an approximately linear increase in the window size 
with time. 

The rationale behind this behavior is as follows: the original window size was 
clearly too large, since it resulted in a packet loss. Hence, a lower window size 
must be used, and this window size is most probably somewhere between W/2 

20 and W. Accordingly, slow-start allows cwnd to reach W/2 very quickly (i.e., 
exponentially); at this point, a bandwidth discovery process is initiated, in 
which TCP attempts, in a greedy manner, to obtain as much bandwidth as it can. 
In case a packet loss occurs again at any step of the process, cwnd is reset to 1 
again, and the process is restarted again (i.e., slow start followed by congestion 

25 avoidance). 

Fast retransmit and fast recovery 

In our discussion of congestion avoidance, we did not differentiate between 
types of packet loss. However, the TCP sender detects packet loss using one of 
30 two methods: 

1 . Timeout: Once TCP sends a packet, it starts a timer RTO. If the ACK for 
that packet is not received at the expiry of RTO, the packet is recorded 
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as lost. RTO is computed in one the following ways for some 
embodiments: 

• During handshake, the time-out is initialized to a large value, for 
example either 3 seconds in some embodiments, 5.8 in some 
embodiments. 

• Once Round-trip time measurements are obtained, then RTO is 
obtained using the following formula in some embodiments: 

RTO = A + 4D, 

Where A is a moving average of the round-trip time, while D is a 
moving average of the mean deviation of the round trip time, i.e. the quantity \M 
-A\, where M denotes the latest round-trip time measurement obtained at the 
sender side. More specifically, for each packet received, A and D are obtained 
as follows (even thought and D are typically only updated once every 500 ms). 

A <^-(l -g)A+gM 
D^D + g{\M-A\-D) 

Where g is a small gain factor (which, in some embodiments is set to a 
negative power of two). 
2. Triple duplicate ACKs: in case a packet is lost, the receiver will generate 
for each following segment received an ACK for the same sequence 
number, which corresponds to that of the lost packet. Hence, in case the 
sender receivers a multitude of ACKs for the same sequence number, it 
can assume that the corresponding packet was lost. In TCP, three 
duplicate ACKs (that is, four ACKs for the same sequence number) 
signal a packet loss. Only in this case do the Fast Retransmit and Fast 
Recovery algorithms kick in for some embodiments. 
Hence, once a triple duplicate ACK is detected, the sender enters two processes: 
it first retransmits the segment it believes is lost. This is called a Fast 
Retransmit (since TCP doesn't wait for a time-out to occur before 
retransmitting). Then, as in congestion avoidance, ssthresh is set to half the 
current window. However, TCP sets cwnd to ssthresh in this case, hence 
avoiding the slow-start phase. 
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Typical (and simplified) traces of the TCP window size in time are shown in 
Fig. 12 for two cases: in one case, transmitting at the maximum window seldom 
saturates the link. This can be the high bandwidth case; effective bandwidth can 
be translated to a scale that is consistent to that of window. That is, we can 
5 actually be showing the product of effective bandwidth and round-trip time. 
Round-trip time can assumed to be constant in the figure (if not, then the 
window size increase may not have been linear with time during the congestion 
avoidance period)). In the other case, the appropriate ideal window to be used 
is significantly lower than the maximum window size chosen by the source. 
10 Clearly, the difference in TCP behavior between the two cases is very 

significant. In particular, the only way the window size is limited in one 
embodiment with the low bandwidth path is through packet loss. Combined 
with TCP's greediness in its attempt to capture more bandwidth from the link, 
this fact leads to a systematic pattern of packet loss. 

1 5 TCP performance 

The description of TCP's mechanisms above is helpful in understanding 
how a performance model for TCP is derived. Here, the model for TCP latency 
derived in N. Cardwell, S. Savage, and T. Anderson, Modeling TCP Latency, 
IEEE INFOCOM 2000, April 2000, is reviewed, as we believe it to be the 
20 appropriate in capturing some embodiments owing to its accurate modeling of 
the protocol handshake and slow-start. The equation below summarizes the 
findings of N. Cardwell, S. Savage, and T. Anderson, Modeling TCP Latency, 
IEEE INFOCOM 2000, April 2000. 
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Latency of a TCP Transaction = RTT + 2TA 
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where 

To - T s = 3 seconds 

RTO = RTT + 4 x mdev(RTT) 

y= 1 +(l/b) 

E[d ss ] = ((l-(l-p) d )(l-p))/p 

E[w ss ] = (E[d ss ](y- l)/y) + (wi/y) 

Q(p,w) = min (1, (1 + (1 - p) 3 (1 - (1 - p) W " 3 )) / ((1 - (1 - P) W V(1 - (1 - P) 3 ))) 

G(p) = 1 + p + 2p 2 + 4p 3 + 8p 4 + 16p 5 + 32p 6 

W(p) = ((2 + b)/(3b)) + Sqrt(((8(l - p))/(3 b p)) + ((2 + b)/(3b)) 2 ) 

D axk ' 100 ms 

As shown in the equation above, the latency of a TCP connection L 
depends on the file size to be transferred d (in number of segments), the packet 
round-trip time RTT, the packet RTT jitter mdev(RTT), and the packet loss rate 
p. L is constituted of various components, which are, in order, the latency 
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associated with the handshake, the slow-start, the first loss, and the transfer of 
the remaining bits. We describe the specifics of each and then include 
comments pertaining to the overall equation. 
Protocol handshake 

5 This equation states that the average time needed to complete the protocol 
handshake is one round trip time, to which a timeout is added to each lost 
packet in the transaction. Note that at this stage, the timeout is typically as large 
as 3 seconds for some embodiements, so even a small loss probability can be 
significant. 

10 

Slow start 

As described above, slow start lasts from the end of the handshake up to the 
time the window size ceases to increase (whether because a packet was lost, or 
because the window has reached its maximum value). In the context of N. 

15 Cardwell, S. Savage, and T. Anderson, Modeling TCP Latency, IEEE 

INFOCOM 2000, April 2000, in an attempt to make the math simpler, slow start 
is given a slightly different meaning. In fact, slow start is defined as the time 
period separating the end of the handshake up to the first loss, independently of 
whether the maximum window size W max is reached. Given the packet loss p 

20 and the dynamics of the window size increase during slow start, both the 

expected number of segments sent during slow start E[d ss ] and the window size 
reached at the end of the slow start process E[JV SS ] are computed. Clearly, as 
shown in the equation, in case E[W ss ]>W max , then the latency calculation is 
divided into two periods, the first capturing the exponential window growth, 

25 and the second capturing the period in which the window is constant, equal to 
W max . It is interesting to note that in case packet loss is null, then slow start 
according to this definition lasts for the entire length of the connection. 
Conversely, in case E[W ss \<W max , then the latency associated with slow start 
only includes a single component that corresponds to the exponential increase in 

30 window size. 
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First loss 

As described above, a packet loss is detected within some time period 
that translates directly into additional delay incurred by the user. This delay 
depends on whether the packet was detected through a time-out or a triple 
duplicate. In case of a time-out, then the delay is equal to RTO, which in this 
case is set to the initial 3 seconds in some embodiments. In case of a triple 
duplicate ACK ? the extra delay is as low as one round-trip time. In the equation 
above, the probability that packet loss occurs because of a timeout Q(p, w) is 
computed, given the loss rate p and the instantaneous window size w. The 
latency associated with the first loss can hence simply be computed as a 
weighted sum of RTO = T Q and RTT. Since a loss event can result in more than 
one packet loss, successive time-outs can potentially occur; the factor G(p)/(l- 
p) takes into account this fact by appropriately scaling the first time-out value 
T 0 . 

Transfer of remaining bits 

The following component of the TCP latency equation corresponds to 
the latency incurred by the transfer of the bits that remain after this first loss. 
Although complex, we can see that the formula basically models the effect of 
loss (whether detected through timeout or the receipt of triple duplicate ACKs), 
and that of congestion avoidance (as witnessed by the computation of W(p), 
which represents the average window size given the loss rate and the dynamics 
of congestion avoidance). Hence, this component of the equation is basically the 
ratio of the remaining bits to be transferred (d- E[d ss ]) to the average rate of 
transfer, in turn equal to one window per time period set to the sum of RTT and 
whatever additional delay is caused by congestion avoidance. What this 
component does NOT model, however, is slow start after retransmission of 
timeouts. This means that the equation above assumes that TCP is more 
aggressive than it actually is, which results to an under-estimation of the actual 
latency involved in the transaction. However, N. Cardwell, S. Savage, and T. 
Anderson, Modeling TCP Latency, IEEE INFOCOM 2000, April 2000, explains 
that the effect of this negligence should be small. 
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Delayed ACKs 

The final component in the equation above models (in a trivial way) the 
extra delay caused by delayed ACKs for some embodiments. In fact, 100 ms 
was measured as a good average delay increase caused by delayed ACKs in 
some embodiments. ACKs are not always sent back to the sender immediately 
after a segment is received at the destination. In fact, the functionality of TCP is 
as follows for some embodiments V. Paxson, Automated Packet Trace Analysis 
of TCP implementations, SIGCOMM' 97: two packets have to be replied to 
immediately. However, the receipt of an individual packet is only replied to at 
the end of a clocking timer, typically set to 200 ms in some embodiments. 
Hence, 100 ms seems like a good average value to be added to take into account 
the effect of duplicate ACKs for some embodiments, 
(e.g., see M. Allman, A Web Server 's View of the Transport Layer, ACM 
Computer Communication Review, Vol. 30, No. 5, October 2000. 

Comments about the model 

The model in M. Mathis, J. Semke, and J. Mahdavi, The Macroscopic 
Behavior of the TCP Congestion Avoidance Algorithm, ACM Computer 
Communication Review, July 1997, is indeed simpler, but also captures much 
less of the richness inherent to the behavior of TCP. As shown above, the 
model captures, in an intuitive way, a lot about TCP's behavior. In this section, 
we go over straightforward and useful comments on the model. 

1 . Latency does not depend on the link bandwidth. In fact, this model does 
not directly model the link bandwidth, but considers it through its effect 
on delay, jitter and loss. For example, if bandwidth is too low, then the 
loss rate increases (as is clear from Fig. 1 1). At the same time, round-trip 
time increases through increased queue sizes at the bottleneck link TCP 
time-outs. Note that for transactions that involve small file sizes, the 
latency is mostly affected by the first two components, i.e. the protocol 
handshake and slow start. Hence, the throughput of the transaction is 
probably unaffected by the bandwidth of the bottleneck link (that is, 
other than the effect of the later on loss rate or RTT). 
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2. Latency is in all cases a linear function of the round-trip time. It is true 
that this is an artifact of the above assumption (since the only non- 
linearity would have come from the effect of bandwidth on the latency 
of the TCP connection). However, since the effect of bandwidth alone is 

5 very small, the linearity of the TCP latency with the round-trip time 

holds in most cases in real systems. 

3. Contrary to round-trip time, latency is a non-linear function of loss 
probability. In fact, for loss rates above 5%, the increase in latency with 
the loss rate accelerates significantly. 

10 In some embodiments, the model described above can be used to model 

TCP behavior in one or more of the steps. 

HTTP 

In some embodiments, the metric measures web transactions. In this 
section, we describe http mechanisms used in some embodiments. In some 

1 5 embodiments, such transactions use the hypertext transfer protocol (HTTP), 
which, in some embodiments defines how TCP is used for the transfer of the 
different components of a web page (that is, for example, the html page, the 
different images, etc). In some embodiments, the first version of HTTP, 
HTTP/1.0, described in RFC 1945 T. Berners-Lee, R. Fielding, and H. Frystyk, 

20 Hypertext Transfer Protocol - HTTP/1. 0, IETF Request for Comment RFC 
1945, May 1996, encourages the following practices: 

1 . Different components of a given web page (e.g., the html text, and each 
of the different objects) are transferred using distinct TCP connections. 

2. In order to increase the speed of the transaction, web browsers are 

25 allowed to open more than one TCP connection at a time. (The typical 

number of parallel concurrent TCP connections is 4 in some 
embodiments.) 

In some embodiments, this approach is adequate for relatively smaller 
web pages, which include a few objects, when traffic on paths on the Internet is 
30 relatively limited. But as the Internet has become more and more ubiquitous, the 
disadvantages of this approach can become, in some embodiments more and 
more apparent: 
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1. On one hand, in some embodiments, an independent TCP connection 
goes through both the handshake and slow-start mechanisms. Since, in 
some embodiments, individual objects are relatively small, so TCP 
connections spend most of their time in these stages, rather than in the 
congestion avoidance stage. In some embodiments, this can be 
inefficient. 

2. On the other hand, it has been shown that in some embodiments, 
opening a multitude of TCP connections simultaneously increases the 
greediness and aggressiveness of the web browser's behavior, H. F. 
Nielsen, J. Gettys, A. Baird-Smith, E. Prud'ommeaux, H.W. Lie, and C. 
Lilley, Network Performance Effects of HTTP/ LI y CSSI, and PNG, 
SIGCOMM' 97, H. Balakrishnan, V. Padmanabhan, S. Seshan, M. 
Steem, and R. Katz, TCP Behavior of a Busy Internet Server: Analysis 
and Improvments, IEEE Infocom 1998, S. Floyd and K. Fall, Promoting 
the Use of End-to-End Congestion Control in the Internet, IEEE/ACM 
Transactions on Networking, 7(6), August 1999. To understand this, we 
provide the following simple example (from S. Floyd and K. Fall, 
Promoting the Use of End-to-End Congestion Control in the Internet, 
IEEE/ACM Transactions on Networking, 7(6), August 1999). Say a data 
transfer is divided among N parallel, concurrent TCP transactions. 
Assume a packet is lost in one of the connections. If all the data were to 
be transferred using one single TCP connection, the lost packet would 
lead to the halving of the window size, i.e. to the halving of the 
connection throughput. Instead, when N concurrent TCP connections are 
used, the lost packet will only halve the window size of one of the N 
connections, leading to a reduction of the aggregate throughput by a 
mere \I2N\ That is, the congestion algorithm that TCP is intended to 
perform is skewed towards a much larger greediness and aggressiveness, 
leading to an increase in congestion that can in turn bear a significant 
degradation in the performance of all streams involved. 

In some embodiments, the first problem can be solved through the setting of the 
Keep-Alive option T. Berners-Lee, R. Fielding, and H. Frystyk, Hypertext 
Transfer Protocol - HTTP/ 1.0, IETF Request for Comment RFC 1945, May 
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1996. In some embodiments, an open TCP connection is not closed after the 
object transfer is completed, but rather used for the transfer of the next object. 
This way, in some embodiments, the number of concurrent TCP connections 
remains equal to four. In some embodiments, the latency of the total web 
5 transfer can be improved by reducing the number of mandatory protocol 

handshakes and slow-starts. However, in some embodiments, the Keep-Alive 
option does not solve the second problem. In fact, in such embodiments, setting 
the Keep-Alive option only exacerbates the second problem, and renders the 
behavior of the web browser even more aggressive. 
1 0 In light of such findings, a revision of HTTP (HTTP/ 1 . 1 , R. Fielding, J. 

Gettys, J. Mogul, H. Frystyk, and T. Berners-Lee, Hypertext Transfer 
Protocol - HTTP/1. 7, IETF Request for Comment RFC 2068, January 1997) 
included new ways to be used by the web-browser for the transfer of a web 
page using TCP. HTTP/1 .1 advocates the use of a single TCP connection for 
1 5 the transfer of the entire web page. Such a connection is called persistent. 

This way, the second problem described above is solved and TCP recovers 
its original aggressiveness. In addition, the client can pipeline requests for 
different objects belonging to the same web page on the same TCP 
connection. This way, the transfer of the corresponding objects is pipelined, 
20 which has the desirable effect of reducing the variability between the delays 

associated to the download of each of the different objects. 
Assume a web page includes the combination of text and 9 images. In Figs. 
13-14, the different ways to transfer a web page in some embodiments are 
illustrated, using some of the various HTTP/ 1.0 and HTTP/ 1.1 methods 
25 described above. 

In some embodiments, assuming infinite bandwidth and zero loss probability, 
HTTP/ 1.0, with the Keep-Alive option set basically divides the time it takes to 

download the images by a factor of 4. However, in some embodiments, 

< 

assuming the realistic case where loss rate is positive, the actual duration of the 
30 web transaction can be assumed to be the maximum among the latency of the 
individual TCP connections; that is, in some embodiments, the duration of the 
web transaction D w can be estimated as the sum of the TCP transfer of the html 
file, L htm u followed by the maximum among the 4 TCP connections, L max = 
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max(Z,i, L 2 , £3, L 4 ). In some embodiments, finding the exact maximum among 
various TCP connections is a complex process, as it implies knowledge of the 
latency distribution, whereas the equation described above only derives the 
average latency. In some embodiments, L max can be approximated, based on the 
5 following assumption: assuming that only one connection among the four 

connections incurs a packet loss, then L max may simply approximated to be the 
duration of that particular transaction. Assuming that the loss probability on the 
link is equal to p 9 each connection then may be assumed to incur a loss rate p, so 
that the probability of no loss by either connection can be approximated to P nl = 

10 1-P = (1-p) 4 - That is, in some embodiments, the probability that either 

connection experiences a loss becomes equal to P - l-(l-p) 4 which, in some 
embodiments, can be approximated to 4p for small p. That is, in some 
embodiments, an approximate value of L max can be obtained using the equation 
for average latency, and in some embodiments, the loss rate can be set to 4p. 

1 5 Other embodiments may use different approaches to yield various models. 

Description of models used 

In the following, we describe different embodiments of models that can be 
derived for both HTTP/1.0 and HTTP/1.1 downloads. In some embodiments, it 
can be assumed that what is downloaded is one or more plurality of objects 
20 from what is assumed to be a "typical" web site. Other embodiments of this 

invention can derive these results for other HTTP embodiments , and other web 
sites. 

In some embodiments, a typical web transaction can be modeled using the 
following: 

25 LA one-segment request for the web page to be transfer. 

2. ... followed by a transfer of a file of size / In some 

embodiments, the actual latency formula representing the 
transfer depends on whether HTTP/1 .0 or HTTP/1 . 1 are used. 
In some embodiments, we can assume that the path is characterized by a loss 
30 rate /?, a round-trip time RTT, and a round-trip time jitter J. 

In the case of HTTP/1 .0, some embodiments assume that the Keep-Alive option 
is set. Also, some embodiments assume that half the file constitutes the actual 
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html ASCII text, whereas the other half includes the different images in the file. 
While, in some embodiments, the ASCII is downloaded using a single TCP 
connection, the information pertaining to the images is downloaded, in parallel 
over 4 independent TCP connections. Denoting the latency of a TCP transfer of 
a file of size f s , over a path characterized by a loss rate p, a round-trip time RTT 
and a round-trip time jitter J by L(f S9 p, RTT, J), then the total duration of the 
web transaction can, in some embodiments, be approximated to 

Amp/1.0 = Z(1500Bytes, A RTT, J) + LtfJ2 9 p 9 RTT, J) + LtfJS, 4p, RTT, J) 

Those skilled in the art can start with a different set of assumptions, follow a 
similar methodology, and derive various other models that derive the total 
duration of an HTTP/ 1.0 web transaction in function of the various dynamic 
parameters of the path. In the case of HTTP/1.1, some embodiments assume the 
use of persistent, pipelined transactions. Since, in some embodiments, a single 
TCP transaction is used for the download of the entire file, the total duration of 
the web transaction can, in some embodiments, be approximated to 

Arrnvi.i = £(1500Bytes,/>, RTT, J) + L(f s , p, RTT, J). 

Those skilled in the art can start with a different set of assumptions, follow a 
similar methodology, and derive various other models that derive the total 
duration of an HTTP/ 1.1 web transaction in function of the various dynamic 
parameters of the path. 

Some web transaction duration results 

In this section, we show a set of web transaction duration (denoted D w ) 
results for some embodiments of this invention, as a function of both round-trip 
time, loss rate and jitter. (See Figs. 15-17.) In the context of these examples, the 
graphs are shown as a function of one-way delay and jitter, simply obtained by 
halving the round-trip time and round-trip time jitter, respectively. Those skilled 
in the art can derive examples, where the parameters in the graphs constitute 
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round-trip parameters. Since, in some embodiments, TCP latency as shown in 
the previous section may be linear with round-trip time, the linearity of web 
transaction duration with one-way delay is expected as a result. (See Figure 15.) 
Also, it can be seen from Figure 1 6 that for some embodiments, the curve 
5 showing the dependence of D, v on p may be convex, that is, its slope increases 
with increasing values of p. Since, in some embodiments, HTTP/1.0 depends on 
4p, the increase of D w with p may become increasingly significant as p exceeds 
5%. Conversely, in some embodiments, loss rate does not seem to affect much 
the duration of an HTTP/1.1 web transaction. Finally, Figure 17 reveals that in 
10 some embodiments, the dependence of D w on jitter is very small for both 
HTTP/1 .0 and HTTP/1 . 1 , irrespectively of the file size; therefore, in some 
embodiments, the jitter parameter may be ignored. 

metric versus web transaction duration 

In this section, some embodiments of the remaining task of mapping 

15 transaction duration to a measure of quality that captures the user's perception 
are described. The embodiments described rely in part on relevant papers on the 
subject, A. Bouch, N. Bhatti, and A. J. Kuchinsky, Quality is in the eye of the 
beholder: Meeting users' requirements for Internet Quality of Service, To be 
presented at CHF2000. The Hague, The Netherlands, April 1-6, 2000, pp. 297- 

20 304.and A. Bouch, N. Bhatti, and A. J. Kuchinsky, Integrating User-Perceived 
Quality into Web Server Design, HP Labs Technical Report HPL-2000-3 
20000121, which present experiments designed to estimate users' perception of 
quality in the context of web transactions. (The paper stresses, in particular on 
e-commerce.) In the papers, the authors have created a web site with latency 

25 programmability. (That is, they control the latency of the transfer of individual 
pages.) Users are asked to go through the different pages of this site, and rate 
the latency obtained as low, average of high. The final result is the graph shown 
in Fig. 18, that shows the percentage of users responding "low", "average" and 
"high" versus the actual duration of the web transaction. 

30 The results shown in Fig. 1 8 can be translatedvto some subjective 

measure of user-perceived quality, which we denote by "synthetic Mean 
Opinion Score" (MOS), from which the metric will be derived. In this respect, 
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in some embodiments, a graph presented in N. Kitawaki and K. Itoh, Pure 
Delay Effects on Speech Quality in Telecommunications, IEEE Journal of 
Selected Areas in Communications, Vol. 9, No.4, May 1991, can be used, that 
maps MOS values to an "impermissible rate", that is, to the percentage of users 
that think the quality is unacceptable. This graph is shown in Figure 19. For 
some embodiments, A MOS versus web transaction duration function can be 
obtained by assuming that the "high latency" rate in Figure 18 can be 
interpreted as an "impermissible rate". In some embodiments, this assumption is 
reasonable, since, as far as the content provider is concerned, high latency is 
unacceptable and should never be experienced by the user. This may be thought 
to be true, especially for high value transactions, such as trading, or even 
shopping. Those skilled in the art can follow the methodology described in this 
document to use other studies of user-perceived quality and derive 
corresponding metrics. 

The resulting MOS versus latency curve can be obtained for some 
embodiments. Interestingly, the curve shows that for these embodiments, the 
MOS does not degrade smoothly as transaction duration increases. In fact, in 
these embodiments a sharp decrease from a MOS of 5 to a MOS of 4 occurs 
when the transaction duration exceeds the two seconds mark. In one 
embodiment, the MOS ratings from 1 to 5 bear the following interpretations: 5 
for Excellent, 4 for Good, 3 for Fair, 2 for Poor and 1 for Bad. Other 
embodiments use different values. Similarly, in some embodiments, a similar 
behavior occurs around the 8 seconds mark. 

Applying the MOS versus latency curve to the latency results: some MOS 
results. 

In some embodiments, deriving the metric from the MOS value can 
comprise a step where the 1 to 5 MOS scale is normalized to a scale ranging 
from 0 to 1 . In Figs. 20-22, we present the resulting metric scores corresponding 
to the web transaction duration results shown in Figs. 15-17, for some 
embodiments of this invention. 
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In some embodiments, such MOS functions can be used to derive 
metrics both for HTTP/1 .0 and HTTP/1 . 1 . For some embodiments, the 
following conclusions can be drawn, as observed from the metric graphs above: 

• For some embodiments, the effect of jitter is negligible. 

• In some embodiments, the effect of loss rate is not as large as one would 
expect. In fact, in some embodiments, TCP latency increases 
considerably as the loss rate increases from 2% to 10%. However, in 
some embodiments, what is important is the user perception. For 
example, even though an increase of latency from 1 00ms to 2 seconds 
represents a 20-fold increase, this increase can, in some embodiments 
have a negligible effect of quality. 

• In some embodiments, The results also show that the file size affects the 
metric significantly, which, in these embodiments, is expected. In some 
embodiments, if the focus is on web transfers of the transactional typq, 
then file sizes of up to 10 KBytes may be considered. 

• In some embodiments, the version of HTTP used has a surprisingly high 
effect on the results. In some embodiments, this is especially the case 
when loss rate is high (that is, larger than 5%). In some embodiments, 
the reason for this behavior is, again, the fact that the effective loss rate 
incurred in one embodiment with HTTP/1 .0 is four times that incurred 
with HTTP/1. 1. 

An additive embodiment of HTTP/1.0 and HTTP/1 A metrics 

In some embodiments of this invention, it is desirable to derive metrics 

for HTTP/1.0 and HTTP/1.1 that are additive (in the sense described above). In 

this section, we describe such embodiments. 

The metrics derived here match for some embodiments, those shown 

graphically in the previous section for a wide range of the elementary dynamic 

parameters of delay, jitter, and loss. Those skilled in the art can use the same 

methodology to derive similar metrics with different parameters. 

Let a, b, c, and d be parameters that can be tailored to the particular application. 

The embodiments described here correspond to HTTP/1.0 and HTTP/1.1, 

respectively: 
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For HTTP/1. 0, a = 1.18, b =0.13, c =0.15, and d =0.25; 
Fpr HTTP/1.1, a =1.30, b -0.31, c =0.41, and J =1.00. 



Let 

x = -(20.0/9.0)*(6 - c); 
j/ = (1.0/9.0)*(10.0*c-6); 

Let pmax = x x delay + y (where delay is the delay in seconds). 
Let loss be a measure of loss on the path. 

if (loss < p mwi Id), then metricLoss = 1.0 - log(l .0- loss/d)llog(l.O- p max Id) 
if (loss > p max Id), metricLoss = 0.0; 

metricDelay =1.0- (delay)! a 

Let metric denote the metric derived for the application; then for this 
embodiment, the value of metric is obtained as follows: 

If (metricLoss + mosDelay > 1.0), metric — metricLoss + metricDelay - 1.0 
If (metricLoss + mosDelay < 1 .0), metric = 0.0. 

The metric value shown in these embodiments is clearly additive, in the sense 
described above. Those skilled in the art can derive additive metrics for other 
applications using different parameters, or different values of these parameters, 
following the methodology described above. 

In some embodiments, the metric can, in addition to being performance related 
(as described above), can include non-performance related characteristics; in 
some embodiments, the non-performance related can include pre-specified route 
preferences. 
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Apparatus for Characterizing the Quality of a Network Path 

In this final section, we describe some embodiments of network devices 
that characterize the quality of network paths and use these in the process of 
routing. Effectively, such a network device configured, such that if the network 
device is connected to at least a network path including a first segment and a 
second segment, the network device performs: 
1 . accessing a first metric and a second metric, 

wherein the first metric and the second metric are at least in part 
quality characterizations of a same plurality of one or more 
network applications, wherein the quality characterization 
characterizes a quality of the same plurality of one or more 
network applications running at one or more segment end-points 
wherein the first metric and the second metric are at least partly a 
function of a same plurality of one or more elementary network 
parameters, wherein the plurality of one or more network 
parameters include one or more of delay, jitter, loss, currently 
available bandwidth, and intrinsic bandwidth 
wherein the first metric is at least partly the function of the same 
plurality of elementary network parameters of the first segment, 
wherein the one or more segment end points include one or more 
end-points of the first segment 

wherein the second metric is at least partly the function of the 
same pluralit of elementary network parameters of the second 
segment, wherein the one or more segment end points include 
one or more end-points of the second segment 
2. adding the first metric and the second metric to generate a third metric, 
wherein 

the third metric is at least partly the function of the same 
plurality of one or more elementary network parameters of the 
network path, wherein the one or more segment end points 
include one or more end-points of the network path 
the third metric is a quality characterization of the same plurality 
of one or more applications 
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In some embodiments, the network device further performs: 
prior to accessing the first or the second metric, generating at least one of the 
first metric and the second metric 

In some embodiments, the network device further performs: 

prior to accessing the first or the second metric, receiving at least one of the first 

metric and the second metric 

In some embodiments, the network device deals with a plurality of one 
or more network parameters that is dynamic. In other embodiments, the network 
device is such that at least one of the plurality of one or more network 
parameters is static. 

As described in the method sections, the plurality of one or more 
network applications include at least one of UDP and TCP applications. UDP 
applications include voice, video, whereas video applications include video 
conferencing. In some embodiments, TCP applications include HTTP, whereas 
HTTP applications include HTTP/1.0 and HTTP/1.1. In some embodiments, 
TCP applications include ftp and telnet. 

In some embodiments, the network device is such that the plurality of one or 
more network parameters include delay, jitter, loss, currently available 
bandwidth and intrinsic bandwidth. 

In some embodiments, the metric can, in addition to being performance 
related (as described above), can include non-performance related 
characteristics; in some embodiments, the non-performance related can include 
pre-specified route preferences. 

In some embodiments, the network device further comprises: 

a plurality of one or more inputs adapted to be coupled to the 
network path, 

a plurality of one or more outputs coupled to the plurality of one 
or more inputs, wherein, responsive to a plurality of one or more 
packets arriving to the network device through the plurality of 
one or more inputs, the network device selects at least one output 
from the plurality of one or more outputs, wherein the at least 
one output is determined at least partly using at least one of the 
first metric, second metric, and third metric. 
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Measurement Packets 

A measurement packet is a packet sent by a sender over an internetwork 
that includes information necessary for the receiver of the packet to compute 
measurements of the performance characteristics of the path the packet has 
traversed over that internetwork. The information includes information for a 
receiver of the measurement packet to compute measurements of performance 
characteristics of at least a portion of the path of the measurement packet; and 
data including one or more of measurement statistics, a generic communication 
channel, network information, and control data directing a receiver of the 
measurement packet to change one or more configuration parameters of the 
receiver. 

In some embodiments of the invention, the information included in the 
measurement packet to compute measurements includes at least one of a 
timestamp of a sending time of the packet and a number to identify the packet 
by itself and/ to identify the relative position of the measurement packet in a 
sequence of measurement packets, 

In some embodiments of the invention, the measurement packet is 
implemented using the following data structure: 

struct MeasurementHeader { 
/ * * 

* A generation number. This value represents when the 

* sender began sending. This value is a standard Unix 

* timestamp that seconds since Jan 1, 1970 UTC . 

* * J 

uint32_t mGeneration; 
/ -k * 

* A sequence number for the packet. This increments each 
* time a packet is sent and rolls over when 16 bits is 

* exceeded. 

* ★ J 

uintl6_t mSequence; 
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/ * •*• 

* The IP address the packet is sent to. 

* * / 

uint32_t mDstAddr; 

5 

/ * -k 

* The send timestamp for this packet. 
uint64_t mSendTime; 

10 >; 

The mGeneration field is used to detect when a sending process has 
started a new session. This field is used by the receiver to determine that a 
discontinuity in the stream's sequence numbers is the result of a sender restart, 

1 5 rather than due to large network latencies, duplicate packets or dropped packets. 

The sequence number mSequence field is incremented by one each 
time a packet is sent. This approach allows the receiver to deduce lost and 
duplicate packets by identifying missing and duplicate sequence numbers. 

The mSendTime field contains the time at which the packet was sent, 

20 represented as microseconds since January 1, 1970 UTC. This field is 

compared to the time the packet arrived at the receiver to determine the delay 
between the sender and the receiver. 

In some embodiments of the invention, a plurality of one or more 
packets are sent over a path continuously. In some embodiments of the 

25 invention, the continuous stream of packet is denoted as a measurement stream. 
Each measurement stream is uniquely identified by the source and destination 
IP addresses. The sender maintains one socket descriptor for each source IP 
address it sends from and writes the destination IP address into the mDstAddr 
field. On the receiver side, the source IP address is returned by the recv() 

30 system call and the destination address is retrieved from the measurement 
packet. 



35 



40 



Data Included in the Measurement Packets 

In measurement packets that contain sufficient space, data will be 
included, including one or more of measurement statistics, a generic 
communication channel, network information, and control data directing a 
5 receiver of the measurement packet to change one or more configuration 
parameters of the receiver. 

Some embodiments of the invention will add a single type of data to 
each packet. Some embodiments of the invention will use a complex data, 
including subpackets. 

1 0 Some embodiments of the invention use subpackets that include a single 

byte subpacket type identifier, followed by a 2 -byte length field (including the 
length of the type and length fields) and finally including the data that is to be 
sent. One embodiment will store all values in network byte order. Other byte 
orders will be apparent to those skilled in the art. The following data structure 

1 5 definition describes some embodiments. 

class SubPacket { 
/* 

* The type identifier for this subpacket. 
20 */ 

uint8_t mType; 

/* 

* The length of this subpacket, in network byte order. 
25 */ 

uintl6_t mLength; 

} ; 

One embodiment of this invention will include data describing a 
30 momentary snapshot of the measurement statistics for a given path between a 
sender and a receiver. 

In some embodiments of this invention, this data will include one or 
more of the following information: the source and destination IP addresses that 
define the path, a measurement packet size for which the statistics have been 
35 calculated as well as computed measurement statistics that are at least partly 
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responsive to delay; computed measurement statistics that are at least partly 
responsive to jitter and computed measurement statistics that are at least partly 
responsive to packet loss. 

In one embodiment of this invention, these statistics will be in units of 
5 microseconds expressed as 64-bit floating-point quantities and transmitted in a 
standard network byte order. 

In one embodiment of this invention, the following data structure will 
store the computed statistics: 

10 class TunnelStatsSubPacket : public SubPacket { 

/ * * 

* The time that this statistic snapshot was taken (in 

* microseconds since 1970). 
**/ 

15 uint64_t mTimestamp; 

* The source IP address of the tunnel these statistics 
apply 

20 * to. 

* * I 

uint32__t mSrcAddr; 

I -k ~k 

25 * The destination IP address of the tunnel these 

statistics 

* apply to. 

* * / 

uint32_t mDstAddr; 

30 

/ * * 

* The size of measurement packet that these statistics 
apply 

* to. A size of 0 indicates that these statistics 
35 apply to 

* all packet sizes. 

* * / 

uintl6 t mPktSize; 



42 



.in s o :7' so a, ,:. :t es o so. a 



/ + * 

* The average delay in microseconds. 

* * / 

5 double mDelay; 

/ * * 

* The average jitter in microseconds. 

* * / 

10 double mJitter; 

/ * * 

* The percentage of packets that have been lost, in the 
range 

15 * 0 to 1. 

** / 

double mLoss; 

}; 

20 Some embodiments of this invention include the time at which the 

statistics were computed such that those statistics are sent over multiple paths 
for improved reliability and to take advantage of one path having less delay than 
another. One embodiment at the receiving end is able to compare the 
computation times of received statistics to place them in their original temporal 

25 order, regardless of their relative arrival times over the paths. 

Some embodiments of this invention will send computed statistics 
specific to the paths that are part of the plurality of one or more paths that are 
between the specific sender and receiver. Other embodiments will send 
additional computed statistics for paths that are not one of the plurality of one or 

30 more paths that are between the specific sender and receiver. 

Some embodiments of this invention will include network information 
concerning network topology including but not limited to information retrieved 
from routers such as in-bound or out-bound link utilization, inbound or out- 
bound link bandwidth and/or CPU utilization. Other network information 

35 determined from routers and other network devices will be apparent to someone 
skilled in the art. 
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Some embodiments of this invention will also include control data 
directing a receiver of the measurement packet to change one or more 
configuration parameters of the receiver. 

In some embodiments of the invention, the control data will instruct a 
5 receiver to alter its configuration, including but not limited to zero or more of 
the following examples: instructing a receiver to initiate sending a plurality of 
one or more measurement packets, change one or more of the measurement 
packet sizes, inter-measurement packet transmission times and mix of packet 
sizes, and stop sending one or more of the plurality of measurement packets. 
10 In some embodiments of the invention, this control information will 

include notification of measurement devices that have joined or left the 
network. 

In many embodiments of the invention, the measurement packets will be 
encrypted by the sender and decrypted by the receiver. Some of these 
1 5 embodiments will use IPSec. 

In some embodiments of the invention, the encryption and decryption 
will be done by an external device using IPSec. 

Other encryption and decryption options will be apparent to one skilled 
in the art. 

20 In some embodiments of the invention, the measurement packets will be 

digitally signed. 

In some embodiments of the invention, a generic communication 
channel will be used by a sender and a receiver to communicate data between 
them. 

25 Performance Characteristics of a Path 

Measurements are used to compute performance characteristics of the 
paths traversed by the measurement packets. The measurements can either be 
computed from the measurement packets themselves, or extracted from the 
arbitrary data carried by the measurement packets. The measurements of 
30 performance characteristics include at least one or more of one-way 

measurements and round-trip measurements. The performance characteristics 
include at least one or more reachability, delay, jitter, loss, available bandwidth, 
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and total bandwidth. Other performance characteristics will be apparent to those 
skilled in the art. 

In some embodiments of the invention, delay measurements are 
computed as the interval of time from the moment the measurement packet is 
5 sent by the sender to the moment of time the measurement packet is received by 
the receiver. The sending time is carried by the packet, and it is measured by the 
clock the sender refers to. The receiving time is measured by a clock that the 
receiver refers to, which may or may not be synchronized with the sender's 
clock. . 

10 In some embodiments of the invention, the clock of the sender and the 

clock of the receiver are synchronized. A plurality of one or more precise clock 
inputs such as GPS, NTP, IRIG and NIST will be used. Some embodiments of 
this invention will use the same clock as an input to more than one of the 
plurality of one or more senders and receivers. In some embodiments of the 

1 5 invention, the clock of the sender and the clock of the receiver are the same. 

In some embodiments of the invention, the clock of the sender and the 
clock of the receiver are not synchronized, and mechanisms based on the 
measurement data are used to correct the clock skew and clock drift, the 
mechanisms including using minimum delay across multiple measurement 

20 samples, and using a mechanism to track the minimum delay over time. 

Some embodiments of the invention will use the minimum round-trip 
delay between the devices to place a lower bound on clock skew. 

Some embodiments of the invention will use the lower bound of 
multiple paths between the sender and receiver to further reduce the lower 

25 bound. 

Some embodiments of the invention will correct for clock drift by 
tracking the relative clock skew between the sender and receiver over time and 
adjusting for the slope of the drift. 

In some embodiments of the invention, jitter measurements, also known 
30 as inter-measurement packet delay variations, are computed as the difference in 
delay on consecutive, successfully received packets. 



45 



JLQi 2 W7 : H O :ii w :i! E! O 5 IE I! E! 



In some embodiments of the invention, jitter can also be computed as 
the difference between the instantaneous delay of a packet, and the average 
delay of all the measurement packets previously received. 

In some embodiments of the invention, loss measurements are computed 
5 by assigning a timeout value to each measurement packet that indicates the 

instant of time after which the measurement packet will be declared lost, if the 
packet has not arrived by that time. In some embodiments of the invention, the 
timeout value of a measurement packet can be computed with the transmission 
time of a previously received packet, an estimation of the inter-transmission 

10 time between measurement packet, and an estimation of the transmission delay 
of the measurement packet. In some embodiments of the invention, the inter- 
transmission time can be estimated if the receiver knows about the scheduling 
pattern of transmission of measurement packets. In some embodiments of the 
invention, the transmission delay of packet can be estimated based on delay and 

1 5 jitter performance characteristics. 

Performance characteristics of a path could be the measurement 
themselves, or statistics on those measurements. In the statistics case, a dynamic 
algorithm is used to updates the statistics associated with a path with every new 
measurement obtained with the arrival of every new packet over the path. 

20 In some embodiments of the invention, the algorithm computes statistics 

over the performance characteristics of the path. 

In some embodiments of the invention, the statistics include averages, 
deviations, and variances. Other statistics will be apparent to those skilled in the 
art. In some embodiments of the invention, averages can be computed using a 

25 plurality of one or more techniques including a moving average, an average 
based on the Robbins-Moro estimator, a window-based average or a bucket- 
based average. Other techniques to compute averages will be apparent to those 
skilled in the art. 

In some embodiments of the invention, the moving average is an 
30 exponentially moving average computed using a Robbins-Moro estimator. The 
Robbins-Moro stochastic approximation estimator finds a solution of the 
equation: 
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Efflt)-x] = 0 

where E is the expectation, f(t) a function and x the estimator. The general form 
of the solution is: 

x(t) = x(t-l) + alpha * [f(t) -x(t- 1) ] = (1 - alpha) * x(t-l) + alpha *f(t) 
5 or, with alpha = (1 - ju), 

x = // * x + (7 - ju) *f 

ju is the weight of the estimator, and determines the amount contributed to the 
average by the function.. In some embodiments of the invention, ju is constant. 
10 In some embodiments of the invention, ju is a dynamic value, whose value 
depends on the last value of the function f according to the formula: 

ju = e*(-f/K) 

where K is a constant that also determines the importance of the last value of / 
with respect to the current value of the estimator x. 

15 In some embodiments of the invention, average delay can be computed using an 
exponentially moving average as follows, 

d = ju*d+(]-ju)*m 

20 where d is the exponentially moving average of delay, m is the last delay 
sample, and ju is the weight of the moving average. 

In some embodiments of the invention, average jitter can be computed using an 
exponentially moving average as follows, 

25 

v = ju * v + (J - ju) * \d - m\ 

where v is the exponentially moving average of jitter, \d - m\ is the last sample 
of jitter, and ju is the weight of the average. 

30 
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In some embodiments of the invention, average jitter can be computed using an 
exponentially moving average as follows, 

v = ju * v + (1 - ju) * \m — m'\ 

5 

Where v is the exponentially moving average of jitter, \m — m'\ is the last 
sample of jitter, m is the last delay sample, m ' is the previous delay sample, and 
// is the weight of the average. 

In some embodiments of the invention, delay and jitter averages can be 
10 combined into a single value as follows: 

l = d + M*v 

Where d is the average delay, v is the average jitter and Mis a constant. 

15 In some embodiments of the invention, average loss can be computed using an 
exponentially moving average as follows, 

p-hat = ju * p-hat + (1 - ju) *p 

where p-hat is the moving average of the loss, p = {0 if packet is received, 1 is 
the packet is declared lost}, and ju is the weight of the exponentially moving 
20 average. 

In some embodiments of the invention, // is determined based on the 
notion of forgiveness against a single packet loss. The forgiveness period is the 
interval of time between the time the packet loss occurs and the time the 
average loss is forgiven. The forgiveness period can be either defined in units of 
25 time, or in number of packets if the rate of the monitoring flow is known. That 
is, the forgiveness period will end after n consecutive packets have been 
received after the loss, when these packets have been transmitted at a certain 
rate. 

The value of the exponentially moving average after receiving the n 
30 packets is needed before // can be determined, and this value is known as the 
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forgiveness threshold. In some embodiments of the invention, the forgiveness 
threshold is chosen arbitrarily. In some embodiments of the invention, the 
forgiveness threshold takes the value: - 

K(l-li) 

5 This value is half of the value of the estimator after the singe loss occurs, and 
thus we call it the half-life threshold. Similarly, we also call the forgiveness 
period under this threshold the half life period. The advantage of using a 
forgiveness threshold greater than zero is that issues related to host-dependent 
floating-point representations reaching that value are avoided. 

10 In some embodiments of the invention, ju is computed by comparing the value 
of the estimator after n consecutive packet arrivals since the loss with the half- 
life threshold: 

p-hat = (1- p) * // A « < V 2 ( 1 - ju) 

Given that n is known because is determined by the value of the half life period 
1 5 and the transmission rate, ju is computed as: 

ju = exp ((In Vi) / n) 

In some embodiments of the invention, two thresholds are defined, an upper 
threshold and a lower threshold. When the value of p-hat exceeds the upper 
threshold, the loss is not forgiven until enough measurement packets are 
20 received consecutively so that the value of p-hat gets below the lower threshold. 

Other mechanisms to compute // will be apparent to for those skilled in the art. 

Path Description 

In some embodiments of the invention, the path traversed by the 
25 measurement packets from the sender to the receiver is such that the path is at 
least partly implemented with at least one of a GRE tunnel, an IPSEC tunnel 
and IPonIP tunnel. Other path implementations using tunnel will be apparent for 
those skilled in the art. 
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In some embodiments of the invention, the path traversed by the 
measurement packets from the sender to the receiver is implemented with a 
virtual circuit, including a frame relay PVC, an ATM PVC or MPLS. Other path 
implementations using virtual circuits will be apparent for those skilled in the 
art. 

Other path implementations will be apparent to those skilled in the art. 

Internetwork Description 

In some embodiments of the invention, the internetwork is implemented 
by a plurality of one or more subnetworks, including a plurality of one or more 
VPNs, a plurality of one or more BGP autonomous systems, a plurality of one 
or more local area networks, a plurality of one or metropolitan area networks, 
and a plurality of one or morewide area networks. 

In some embodiments of the invention, the internetwork is implemented 
by an overlay network. 

Other internetwork implementations will be apparent to those skilled in 

the art. 

Packet Sizes and Transmission Times 

In some embodiments of the invention, the measurement packets are of 
varying sizes, including 64, 256, 512, 1024, 1500 bytes. 

In some embodiments of the invention, the size of the measurement 
packets is specified with an external API. 

In some embodiments of the invention, the measurement packets are of a 
fixed size. 

In some embodiments of the invention, the measurement packet sizes 
and times between measurement packets simulate the traffic pattern of a 
plurality of one or more applications 

In some embodiments of the invention, traffic patterns correspond to voice 
applications, where the packets re of small size, e.g., 30 bytes, and the inter- 
transmission time between consecutive packets is constant, e.g., 10 ms. These 
examples do not limit the possible size values and inter-transmission time 
values. 
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In some embodiments of the invention, traffic patterns correspond to 
video applications, where the packets size is the largest permitted to be 
transmitted by an internetwork without being fragmented, and the inter- 
transmission time between consecutive packets varies depending on the spatial 
5 and temporal complexity of the video content being transmitted, the 
compression scheme, the encoding control scheme. 

In some embodiments of the invention, traffic patterns correspond to the 
plurality of applications observed in an internetwork, including at least one or 
more of HTTP transactions, FTP downloads, IRC communications, NNTP 
10 exchanges, streaming video sessions, VoIP sessions, videoconferencing sessions 
and e-commerce transactions. Other types of applications will be apparent to 
those skilled in the art. 

In some embodiments of the invention, the inter-measurement packet 
transmission times are of varying length. 
15 In some embodiments of the invention, the inter-measurement packet 

transmission times are of fixed length. 

In some embodiments of the invention, the inter-measurement packet 
transmission times specified with an external API. 

In some embodiments of the invention, the length of the inter- 
20 measurement packet transmission times is randomized according to a 

distribution. In some embodiments of the invention, this distribution is based at 
least in part on a uniform distribution. In some embodiments of the invention, 
this distribution is based at least in part on an exponential distribution. In some 
embodiments of the invention, this distribution is based at least in part on a 
25 geometric distribution. Other distributions will be apparent to those skilled in 
the art. 

In some embodiments of the invention, the length of the inter- 
measurement packet transmission times is provided by a table. 

In some embodiments of the invention, the length of the inter- 
30 measurement packet transmission times is controlled by a scheduler. In some 
embodiments of the invention, the scheduler uses a priority queue, keyed on 
desired send time. 
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Other mechanisms to specify the inter-measurement packet transmission 
time will be apparent to those skilled in the art. 

Other packet sizes and transmission times will be apparent to those 
skilled in the art. 
5 Path Selection 

It is possible that multiple alternative paths between a sender and a 
receiver are available through an internetwork at any given moment. 
Performance characteristics of each of these paths can be used to select a subset 
of the paths. 

10 In some embodiments of the invention, the subset of the plurality of 

paths is selected based at least in part on at least one of: one or more of the 
measurement statistics from the measurement packet and one or more of the 
computed statistics. 

In some embodiments of the invention, the selection of the subset of the 

1 5 plurality of paths is based at least partly on the position of paths in a ranking. In 
some embodiments of the invention, the ranking is at least partly based on one 
or more of the measurement statistics included as data in the measurement 
packet. In some embodiments of the invention the ranking is at least partly 
based on the computed statistics of the path. In some embodiments of the 

20 invention the ranking is implemented by using a comparison function to 
compare the paths, and by ordering the paths in a decreasing order. In some 
embodiments of the invention the ranking is implemented by using a 
comparison function to compare the paths, and by ordering the paths in an 
increasing order. Other ranking techniques will be apparent to those skilled in 

25 the art. 

In some embodiments of the invention, the ranking is based on a single 
score associated to each path. In some embodiments of the invention, this score 
is denoted Magic Score (MS), and it is computed as follows: 

30 MS = ML * MF 

ML = d + M * v 
MF = delta * p-hat + 1 
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where ML is the Magic Latency, a component of the MS obtained using delay 
and jitter respectively calculated with statistics; and MF is the Magic scaling 
Factor that multiplies the value of ML, and is computed based on loss statistics. 
M is a constant that takes several values, including 4, for example. MS can be 
5 seen as a scaled-up version of ML, and the scaling factor MF is a function of p- 
hat and delta, a constant. As p-hat not only reflects loss but also detects large 
delay spikes before they happen, p-hat can be seen as an indicator of the 
departure of the path from a "normal mode" operation, and thus the scaling 
factor is only applied when there are loss or spikes. The goal of MF is to 
10 differentiate between paths that have very similar delay characteristics, but with 
one having losses and the other not having them. 

In some embodiments of the invention, ML is used as a delay indicator, 
given that jitter is accounted as an increase in delay. In contrast, MS, although a 
scaled version of ML, cannot be used to indicate delay, except when MF = 1 (p- 
1 5 hat = 0), which leads to MS = ML. That means the value of MS is useful not by 
itself but to compare it with the MSs of other tunnels. 

In some embodiments of the invention, loss statistics can be used as a 
discriminator instead of a scaling factor. That is, p-hat can eliminate paths 
experimenting loss. Then, the remaining paths can be selected using MS = ML. 

20 In some embodiments of the invention, the selection of a subset of paths 

is based on applying at least one or more thresholds to at least one of more of 
the statistics. 

In some embodiments of the invention, a single threshold is used, and 
computed as a certain percentage of the highest score of the paths. In some 
25 embodiments of the invention, the threshold is determined by subtracting a 
fixed quantity to the highest score of the paths. 

In some embodiments of the invention, the number of paths in the subset 
of paths is fixed. In some embodiments of the invention, this fixed number of 
paths N out of M paths is determined such that the probability of having loss in 
30 (M - N) paths simultaneously is less than a certain threshold. In some 
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embodiments of the invention, this probability is a binomial, with the 
assumption that all paths have the same probability of loss. 

In some embodiments of the invention, the selection of the subset of the 
plurality of paths is based at least partly on a probability associated with each 
path. In some embodiments of the invention, the probability of each path is at 
least partly based on one or more of the measurement statistics included as data 
in the measurement packet. 

In some embodiments of the invention, the probabilities of each path are 

equal. 

In some embodiments of the invention, the selection of the subset of the 
plurality of paths is based at least partly on the cost of the path. 

In some embodiments of the invention, the selection of the subset of the 
plurality of paths is based at least partly on the amount of bandwidth consumed 
over a period of time. 

Other possibilities to compute path probabilities will be apparent to 
those skilled in the art. 

Other mechanisms to select a subset of the paths will be apparent to 
those skilled in the art. 
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